Efficient quantum circuits for arbitrary sparse unitaries 
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Arbitrary exponentially large unitaries cannot be implemented efficiently by quantum circuits. 
However, we show that quantum circuits can efficiently implement any unitary provided it has 
at most polynomially many nonzero entries in any row or column, and these entries are efficiently 
computable. One can formulate a model of computation based on the composition of sparse unitaries 
which includes the quantum Turing machine model, the quantum circuit model, anyonic models, 
permutational quantum computation, and discrete time quantum walks as special cases. Thus we 
obtain a simple unified proof that these models are all contained in BQP. Furthermore our general 
method for implementing sparse unitaries simplifies several existing quantum algorithms. 

PACS numbers: 



I. QUANTUM ALGORITHM 

Following ^ we call an N x N matrix V combinatori- 
ally block diagonal if there exists a permutation matrix P 
such that PVP~^ is block diagonal and the largest block 
is dxd for d — polylog(7V). We say V is row-computable 
if there is a polynomial-time algorithm which, given a row 
index i G {1,2,..., N}, computes the values of all of the 
nonzero matrix elements in row i of V. In [l| Aharonov 
and Ta-Shma show that quantum circuits can efficiently 
implement all row-computable combinatorially block di- 
agonal unitaries. More precisely, one can implement 
U satisfying \\U — V\\ < e using poly(log A^, d, log(l/e)) 
quantum gates. As a corollary to this, quantum circuits 
can also efficiently simulate time evolution according to 
combinatorially block diagonal Hamiltonians. That is, 
one can implement U satisfying ||C/ — e~'^*|| < e using 
poly (log A^, d, log(l/e)) gates Ij. 

We call a matrix row-sparse if each row has at most 
polynomially many nonzero entries. A row-sparse Hamil- 
tonian can always be written as a sum of polynomi- 
ally many combinatorially block diagonal Hamiltoni- 
ans. Thus, using the Trotter formula, one can im- 
plement the time evolution e~'^* for any N x N d- 
sparse Hamiltonian H to precision e using a quantum 
circuit of po\y{\ogN,d,\\Ht\\,l/e) gates[l|. Subsequent 
work has improved the efficiency of such Hamiltonian 
simulations This suggests the question of whether 
sparse unitaries can generically be implemented by quan- 
tum circuits of polynomially many gates. As shown be- 
low, the answer is yes. 

Let U he a.n N X N unitary such that each row or 
column has at most d nonzero entries, and these entries 
are efficiently computable. Adapting ideas from Q we 
let 



H = 



U 

W 



(1) 



It is easy to see that iJ is a row-sparse, row-computable 
Hermitian matrix, = 1, and ||iJ|l = 1. Because — 



1 it follows that 



-iHe 



cos(6')l -isin{e)H. 
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As discussed above, the time evolution e~*^* induced by 
any row-sparse row-computable N x N Hamiltonian H 
can be implemented to precision e by a quantum circuit of 
poly (log A^, d, ||iJi||,l/e) gates. By equation 2, choosing 
t ~ tt/2 yields 



iHt 
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Thus, 
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(3) 
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The global phase is irrelevant. The action |1) |0) 
on the ancilla qubit can be made into the identity by 
adding an initial NOT gate. 

This technique works for all sparse efficiently com- 
putable unitaries, but it is not restricted to sparse uni- 
taries. It works for any U such that H, as given by 
equation [1] can be efficiently simulated. Recent results 
of Childs^] show, among other things, how to efficiently 
simulate any Hamiltonian H whose entrywise absolute 
value has at most polynomially large operator norm. By 
our results, this implies efficient quantum circuits for all 
unitaries satisfying the same condition. Furthermore, 
any future advances in Hamiltonian simulation will im- 
ply, via the technique given here, a corresponding ad- 
vancement in efficient quantum circuit implementation 
of unitaries. 

Simple counting arguments show that arbitrary (non- 
sparse) unitaries on n qubits require exponentially many 
quantum gates to implement [6] . A simple procedure us- 
ing 0(2^") quantum gates to implement an arbitrary uni- 
tary is given in section 4.5 of 6^. Far more sophisticated 
techniques have subsequently been devised to construct 
more efficient (but still exponentially large) quantum cir- 
cuits implementing arbitrary unitaries on n qubits f^-[l^. 
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II. APPLICATIONS 

We can formulate a model of quantum computation 
based on sparse unitary matrices. On n qubits, we con- 
sider any row-sparse, column-sparse, row-computable, 
and column-computable unitary to be implementable 
with unit cost. By the results above, the computa- 
tions achievable with polynomial cost in this model can 
all be simulated efficiently by quantum circuits. That 
is, the problems solvable in polynomial time in this 
model are all contained in BQP. Conversely, quantum 
gates are row-sparse, column-sparse, row-computable, 
and column-computable unitaries due to their tensor 
product structure. Thus the sparse unitary model is 
equivalent to BQP. 

We next show that the quantum Turing machine 
model, anyonic models, permutational quantum compu- 
tation, and discrete time quantum walks all lie within the 
sparse unitary model of quantum computation. Thus we 
obtain as an immediate corollary to our efficient imple- 
mentation for sparse unitaries the fact that all of these 
models are contained in BQP. Previously, these contain- 
ments were each proved by different methods, some fairly 
complicated[i[li-[ll|. 

A. Quantum Turing Machines 

Following a quantum Turing machine consists of 
a two-way infinite tape such that each site has a finite 
alphabet S of states, and a head that moves along the 
tape one step at a time, manipulating the states of the 
sites and transitioning between a finite set Q of internal 
states. The location of the head, the strings written on 
the tape, and the internal state of the head are all allowed 
to go into superposition. The dynamics of the quantum 
Turing machine is determined by a transition rule A : 
QxExQxEx {L, i?} — >• C. If g e Q is the current 
state of the head, cr e S is the letter on the tape at 
the current location of the head, and d S {L,i?}, then 
A{q,a,q' ,a' ,d) is the amplitude for the head to change 
the letter on the tape to a' , transition into the internal 
state q' , and move one step left or right on the tape 
depending on the value of d. (In a slight variant, some 
definitions also allow the head to stay in place.) A is 
not allowed to be arbitrary. The evolution it induces 
on the combined state space of the head and tape must 
be unitary. Furthermore, to prevent the "smuggling" of 
uncomputable quantities into the model, the amplitudes 
are required to be efficiently computable. 

Let [/a be the unitary transition matrix induced by 
applying the transition rule defined by A once. If we 
consider the entire infinite tape, then Ua is infinite- 
dimensional. However, after t computational steps, the 
number of tape locations that can have been accessed is 
at most 2t + 1. Thus, the dynamics at each of the first 
t steps is completely captured by an m x m truncated 
matrix J7^*^ with m = {2t + l)|(3||S]p+^ In general, the 



matrix C/^*'' lacks any simple tensor product structure 
or block diagonality. However, it is sparse. Specifically, 
by the rule A, a given configuration has nonzero transi- 
tion amplitude to only 2|I]||(3| other configurations. (Or 
3|I]||(5| if the head is allowed to stay in place.) From 
initial state |a;), the computation performed by the quan- 
tum Turing machine after s < t steps is ^f/^'') l-^)- '^'^ 
simulate this, we can use our general technique for sparse 
unitaries. There is one extra technicality: truncation can 
result in a matrix [7^*^ that is not completely unitary. 
As shown in appendix [XI one can achieve exponentially 
small error by using t/^^*-* rather than f/^^ in the Hamil- 
tonian of equation [T] 



B. Topological and Permutational Computing 

In topological quantum computation, one computates 
by braiding anyons. For example, it is thought that the 
SU{N)k Chern-Simons anyons may arise as quasiparticle 
excitations in certain quantum Hall systems, and these 
may provide a robust way of performing quantum com- 
putation. Braiding n anyons induces a unitary repre- 
sentation of the n-strand braid group -B„. In addition 
to braidin g, o ne can also fuse and split anyons. As de- 
scribed in [l6|.[l8}. the fusion rules provide a natural basis 
for the state preparations prior to braiding and the mea- 
surements at the end of the topological computation. 

The braid group i?„ on n strands is generated by 
CTi, . . . , fT„_i, where aj is the clockwise crossing of the 
neighboring strands j and j + 1. For the SU{N)k Chern- 
Simons anyons, in the basis defined by the anyonic fusion 
rules, the representations of the generators cti, . . . , cr„_i 
are each direct sums of easily computable 2x2 blocks. 
Thus, the representations of these generators are sparse 
row- and column-computable unitaries, so any braid of 
polynomially many crossings can be simulated efficiently 
in the sparse unitary model. Conversely, any quantum 
circuit of polynomially many gates can be simulated by 
a braid of polynomially many crossings [Tsi [lOj . The 
quantum algorithms for estimating Jones and HOMFLY 
polynomials also work by implementing these same rep- 
resentations of Bn using quantum circuits [T3. [20l. |2]|. 

Recently, a model of computation has been proposed 
based on the permutation of spin- 1/2 particles in states 
of definite total angular momentum[l6|. This model is 
closely analogous to topological quantum computation. 
The transformations induced by permuting n particles 
form a unitary representation of the symmetric group 
Sn- The basis for state preparation and measurement 
come from the fusion rules for adding angular momen- 
tum (i.e. for the combination of irreducible representa- 
tions of SU{2)). Sn is generated by si, . . . , s„_i, where 
we imagine Sn permuting n objects arranged on a line, 
and Sj exchanges the j^^ and (j + l)**^ objects. The result- 
ing representations of si, . . . , s„ in the fusion basis has 
the same 2x2 block diagonal structure as in topological 
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quantum computing. These computations thus also fall 
into the sparse unitary framework. 

C. Subgroup Adapted Bases 

The block diagonal structure of the representations of 
the generators of 5„ and i?„ is not a coincidence. This 
structure stems from the fact that the fusion bases are 
subgroup adapted to the chain of subgroups Sn 3 Sn-i D 
. . . D 5*2 in the case of Sn and Bn D Bn-i D . . . D B2 
in the case of B„. As described in [l^ such subgroup 
adapted structure causes the irreducible representations 
of a suitable set of generators for many groups to be 
direct sums of constant or polynomial size matrices. Ef- 
ficient implementations of the resulting sparse unitaries 
are an essential ingredient of many non-Abelian quan- 
tum Fourier transforms including Beals' efficient quan- 
tum Fourier transform over the symmetric group j2J]. 

D. Quantum Walks 



time quantum walks and Turing machines cannot. Dis- 
crete time quantum walks can be efficiently simulated on 
quantum circuits using elementary techniques [lEl ■ How- 
ever in the case of Turing machines, our result on sparse 
unitaries provides a substantial simplification over pre- 
viously known proofs. Beyond this, we hope that the 
sparse unitary model of computation will be useful for 
the discovery of new fast quantum algorithms. 
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Discrete time quantum walks are a quantum ana- 
logue to classical random walks. For example, con- 
sider the standard discrete quantum walk on the torus 
TLjnL. The Hilbert space is spanned by |a;,?), where 
X E {0, 1, 2, . . . , n — 1} arc the sites on the torus, and 
i G {0, 1} is the value of an ancilliary degree of freedom 
called the coin^, HI]- Each step in the walk consists of 
two substeps. First, a Hadamard gate is applied to the 
coin. Then, the walker is moved one step to the right or 
left depending on the value of the coin: 

\x,0) ^ |x-l,0) , . 

\x,l) ^ + 

(Here the additions and subtractions are done modulo n.) 
The above two steps yield a 2n x 2n unitary matrix with 
two nonzero entries in each row or column, and which 
is not combinatorially block diagonal. More generally, 
one can implement a discrete time quantum walk on any 
regular graph of degree d by using a coin of dimension d. 
For constant or polynomial d, each step of the quantum 
walk is the application of a sparse unitary. This places 
quantum walks within the sparse unitary framework. 

III. CONCLUSION 

As shown above, essentially all known discrete models 
of quantum computation are special cases of our sparse 
unitary model. Thus, as an immediate corollary to our 
result we find that all of these models are efficiently simu- 
latable by quantum circuits. Previously the proofs of this 
for the various models were spread over many papers. Al- 
though some of these proofs can be made by appealing to 
the previously known method for implementing combina- 
torially block diagonal unitaries, the proofs for discrete 



Appendix A: Truncation of Turing Machines 

Corresponding to the transition rule A of a quantum 
Turing machine, we define a truncated transition matrix 
C/^*' acting on a finite-dimensional Hilbert space. Specif- 
ically, we define U^^ by its matrix elements as follows. 

{p',q',s'\U^2^\p,q,s)^ 

A(g, s, q', s', L)(5p',p_i -|- A(g, s, q' , s', R)Sp'^p+i 

Here p and p' are head positions, allowed to range from 
—t to t, q and q' are internal states of the head, and s 
and s' are states of a finite tape consisting of the cells 
—t,—t+ l,...,t. In general, U^'^ is not quite unitary 
because the full Turing machine has nonzero amplitudes 
to transition to head positions outside {—t, . . . ,t} and 
these have been cut off. Nevertheless, the corresponding 
Hamiltonian 

[ (u^I^V _ 

is Hermitian and sparse. Because C/^*"* is not unitary, 
equation[2]does not hold. If we start with the head at site 
r and time evolve for time tt/2 with H(2t)j the resulting 
state is 

n=0 

(Al) 

An application of the transition rule A, and hence an 

(2t) 

application of or H(2t)i moves the head at most 

one step to the left or right. Hence, for any r, n < 
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^' J^("2t)k'9o,so) = fl""|r,go,so). Thus in applying 
g-jff(2t)-n-/2 g^j^y state with support only at head 
positions between —t and t, the truncation error only 
appears at orders t and higher in the Taylor series lAll 
Thus, the truncation error ftrunc is 

= iiEH^(^/2)"(^r2t)-^")iV't)ii 

n— t 
n—t 

Applying \\H\\ = 1 and ||-ff(2t)|| < 1 yields 



To simulate the dynamics of the Turing machine for t 
steps, one must apply the unitary transition rule t times. 
If the head starts at location then the state during the 
first t time steps always has support only on head loca- 
tions between —t and t. Thus the above error analysis 
holds at each step, and the total error is therefore at most 
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